perm filename CHAP4[4,KMC]7 blob sn#018749 filedate 1973-01-05 generic text, type T, neo UTF8
00100	    		CHAPTER FOUR
00200		PROBLEMS FOR COMPUTER UNDERSTANDING OF NATURAL LANGUAGE 
00300		 IN TELETYPED PSYCHIATRIC INTERVIEWS
00400	
00500	
00600	     By `natural language` I shall mean everyday American English
00700	such as is used by readers of this book in ordinary conversations.
00800	It is still difficult to be explicit about the processes which
00900	enable hummans to interpret and respond to natural language.
01000	Philosophers, linguists and psychologists have 
01100	investigated natural language with various purposes and few
01200	useful results.  Now attempts are being made in artificial intelligence to write        
01300	algorithims which `understand' natural language expressions.     
01400	     During the 1960's when machine processing of natural language
01500	was dominated by syntactic considerations, it became clear that
01600	syntactical information alone was insufficient to comprehend the expressions of ordinary conversations.
01700	The current view is that to understand what is said in linguistic expressions, syntax and semantics
01800	must be combined with beliefs   from an underlying conceptual
01900	structure having an ability to draw inferences.
02000	How to achieve this combination efficiently with a large data-base represents a monumental task for
02100	both theory and implementation.
02200	     Since the behavior  being simulated by  our paranoid model is the
02300	language-behavior  of a paranoid patient in a psychiatric
02400	interview, the model must have an ability to interpret and respond to
02500	natural language input sufficient to demonstrate  language-behavior 
02600	characteristic of the paranoid mode. How language is understood
02700	depends on the intentions of the producers and interpreters in the dialogue.
02800	And language is understood in accordance with the game being played.
02900	Our purpose was to develop a method for understanding everyday English sufficient
03000	for the model to behave conversationally in a paranoid way in a
03100	circumscribed situation. 
03200	
03300	We did not try to construct a general-purpose algorithm which could understand anything
03400	said in English by anybody to anybody in any dialogue situation. (Is it possible?)
03500		We took as a pragmatic measure of "understanding" the ability of the algorithm
03600	to `get the message' of an expression by grasping the imperative intent
03700	of the interviewer ,i.e.what effect he is trying to bring about in the interpreter relative to the topic.
03800	This straightforward approach to a complex problem has its
03900	drawbacks, as will be shown, but we strove for an idiosyncratic idiolect
04000	sufficient to demonstrate paranoid processes of an individual in a particular situation rather than 
04100	for a general or standard-ideal comprehension of English. If the language-understanding
04200	process got in the way of demonstrating the paranoid processes, we would
04300	consider it defective and insufficient for our purposes.
04400	           (Insert from Machr here)
04500		The main  problems 
04600	a dialogue algorithm must cope with in a psychiatric interview will now be discussed.
04700	                  QUESTIONS
04800	
04900		The principal sentence-type used by an interviewer consists of
05000	a question. The usual wh-  and yes-no questions must be recognized by
05100	the language-analyzer. In teletyped interviews a question may sometimes be
05200	put in declarative form followed by a question mark as in:
05300	        (1) PT.-  I LIKE TO GAMBLE ON THE HORSES.
05400	            DR.- YOU GAMBLE?
05500	
05600	Particularly difficult are `when' questions which require a memory which
05700	can assign each event a beginning, end and a duration. Also troublesome
05800	are questions such as `how often', `how many', i.e. a `how' followed
05900	by a quantifier.
06000		In constructing a simulation of a thought process it is arbitrary
06100	how much information to represent in memory. Should the model
06200	know what is the capital of Alabama? We took the position that the model
06300	should know only what we believed it reasonable to know about a few
06400	hundred topics expectable in a psychiatric interview. Thus the model performs badly when subjected to baiting
06500	`exam' questions designed to test its limitations rather than to seek
06600	useful psychiatric information.
06700		     IMPERATIVES
06800	
06900		Typical imperatives in a psychiatric interview consist of 
07000	expressions like: 
07100		(2) DR.- TELL ME ABOUT YOURSELF.
07200		(3) DR.- LETS DISCUSS YOUR FAMILY.
07300	Such imperatives are equivalent to interrogatives
07400	about the topics they refer to. Since the only physical action the model
07500	can perform is to `talk' , imperatives should be treated as requests
07600	for information.
07700	              DECLARATIVES
07800	
07900		In this category I shall lump everything else. It includes
08000	greetings, farewells, yes-no type answers, existence assertions 
08100	and predications made upon a subject.
08200	Naturally negations must be checked for.
08300	
08400	              AMBIGUITIES
08500	
08600		Words have more than one sense, a convenience for human memories
08700	but a pain for language-analysing  algorithms. Consider the word `bug' in
08800	the following expressions:
08900		(4) AM I BUGGING YOU?
09000		(5) AFTER A PERIOD OF HEAVY DRINKING HAVE YOU FELT BUGS ON YOUR SKIN?
09100		(6) DO YOU THINK THEY PUT A BUG IN YOUR ROOM?
09200	In (4) the term `bug' means to annoy, in (5) it refers to an insect and
09300	in(6) it refers to a microphone used for survellience. Some common words
09400	like `run' have fifty or more common senses. Context must be used to
09500	carry out disambiguation, as described in 00.0. Also we have the advantage
09600	of an idiolect where we can arbitrarily restrict the word senses. One
09700	characteristic of the paranoid mode is that no matter in what sense the
09800	interviewer uses a word, the patients idiosyncratically interprets it
09900	in the sense relevant to his pathological malevolence beliefs.
10000	  		ANAPHORIC REFERENCES
10100	
10200		The common anaphoric references consist of the pronouns `it',
10300	`he', `him', `she', `her', `they', `them' as in:
10400		(7) PT.-HORSERACING IS MY HOBBY.
10500		(8) DR.-WHAT DO YOU ENJOY ABOUT IT?
10600	The algorithm must recognize that the 'it' refers to `horseracing'. More
10700	difficult is a reference more than one I/O pair back in the dialogue as in:
10800		(9) PT.-THE MAFIA IS OUT TO GET ME.
10900		(10) DR.- ARE YOU AFRAID OF THEM?
11000		(11) PT.- MAYBE.
11100		(12) DR.- WHY IS THAT?
11200	The `that' of (12) does not refer to (11) but to the topic of being
11300	afraid which the interviewer introduced in (10). Another pronominal
11400	confusion occurs when the interviewer uses `we' in two senses as in:
11500		(13) DR.- WE WANT YOU TO STAY IN THE HOSPITAL.
11600		(14) PT.- I WANT TO BE DISCHARGED NOW.
11700		(15) DR.- WE ARE NOT COMMUNICATING.
11800	In (13) the interviewer is using `we' to refer to psychiatrists or 
11900	the hospital staff while in (15) the term refers to the interviewer and patient.
12000	
15500	
15600	FRAGMENTS
15700	
15800	     Another major problem for algorithms which attempt to understand
15900	discourse consists of the fact that many of the input expressions
16000	are not well-formed.  All sorts of fragments and ellipses appear
16100	which must somehow be connected to conceptualizations under discussion.
16200	For example, consider the following exchange:
16300	
16400	   
16500	     {10} Dr. - `How do you like the hospital?'
16600	
16700	     {11} Pt. - `I shouldn't be here.'
16800	
16900	     {12} Dr. - `Why not?'
17000	The question {12} is an elliptical expression for the full concept
17100	ualization
17200	
17300	     `Why should you not be in the hospital?'
17400	
17500	     Junk words {`well now'} {`tell me more'} and go ahead signals
17600	must be responded to by continuation of a topic.
17700	
17800	For example:
17900	
18000	     {13} Pt.- `I went to the track last week.'
18100	
18200	     {14} Dr. - `Really?'
18300	
18400	Such expressions as {14} stand in a meta-relation to the topic and
18500	serve to keep the conversation going.
18600	
18700	REJOINDERS
18800	
18900	     Sometimes the input expression from the interviewer is a rejoinder
19000	, a reply to a reply by the patient.  For instance:
19100	
19200	     {15} Dr. - `Who are you afraid of?'
19300	
19400	     {16} Pt. - `The Mafia is out to get me.'
19500	
19600	     {17} Dr. - `I would be afraid of them also.'
19700	
19800	in which {17} is a rejoinder.  Such expressions are not requests for
19900	information but provide information for the patient's model of the
20000	interviewer.
20100	
20200	INTERVIEWER-INTERVIEWEE RELATIONS
20300	
20400	     It is characteristic of psychiatric interviewing that the
20500	participants from time to time do not simply talk about the
20600	patient.  Two situations exist concurrently in an interview,
20700	one being talked about and one the participants are in.  At
20800	times the second situation becomes the first.  When the partici
20900	pants discuss one another and their relation, the dialogue
21000	expressions contain intentional verbs which in English fit the
21100	pattern `I X you' or `you X me'.  The comprehension process must
21200	distinguish clearly between subject and object in the case of some
21300	of these verbs.  For example in
21400	
21500	     {18} `I like you'
21600	
21700	the speaker 'I' experiences the liking but in
21800	
21900	     {19} `Do I please you?'
22000	
22100	the `you' experiences the pleasure as a consequence of something
22200	`I' does.